User Behaviour Analysis Based on Time Spent on Web Pages
نویسنده
چکیده
Nowadays Internet and network related service and content providers try to collect valuable service usage data and process it using different methods to know their user’s behaviours. Heretofore quality of online contents is measured with number of page impressions (a request to load a single page of an Internet site) or number of hits (refers to a request for a file from a web server). But both indicators regard to the quality of incoming links rather than quality of contents. Time spent on web pages (TSP) gives a more usable description of quality of pages where TSP is useful time as long as user is on given page. In this paper, we provide a clustering approach to make groups of similar Web pages by distributions of spent times. The distribution of spent time is different at dissimilar types of pages (e.g. registration forms, index pages, news, descriptions of products). But difference has other meaning if page is in the same type. These points to the fact that pages in same type with different distributions influence otherwise reading strategy of users. This may mean that page with different distribution is better or worse than other pages in the same type, independently of hits or impressions. Understanding users’ reading strategy brings us nearer up to measure the quality of contents more exactly. In addition we describe an approach wherewith we are able to eliminate the effects of the stateless status of HTTP protocol and derive useful spent time in the preprocessing step by dissociate the different user activities. To this we define three user activities and connected time distributions: reading, searching links and backward stepping. Our approaches are tested on log files generated by a commercial website.
منابع مشابه
A New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملتشخیص ناهنجاری روی وب از طریق ایجاد پروفایل کاربرد دسترسی
Due to increasing in cyber-attacks, the need for web servers attack detection technique has drawn attentions today. Unfortunately, many available security solutions are inefficient in identifying web-based attacks. The main aim of this study is to detect abnormal web navigations based on web usage profiles. In this paper, comparing scrolling behavior of a normal user with an attacker, and simu...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملMining Navigation Histories for User Need Recognition
The time spent using a web browser on a wide variety of tasks such as research activities, shopping or planning holidays is relevant. Web pages visited by users contain important hints about their interests, but empirical evaluations show that almost 40-50% of the elements of the web pages can be considered irrelevant w.r.t. the user interests driving the browsing activity. Moreover, pages migh...
متن کاملUse of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008